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ABSTRACT 

Initial investigations were made into the feasibility 
of estimating populations through a multiple regression technique 
during summer 1972, There was a relationship between population and 
the 2 independent variables tested — school enrollment and residential 
electrical hook-ups-~when allowance was made for size through the use 
of dummy variables. The sample was limited to 14 communities. 
Research continued into fall 1972 with the sample size increased to 
35 communities. The expanded sample vas tested to see if a 
relationship r-till existed between the population and electrical 
hook-ups and school enrollment. Since satisfactory results could. not 
be obtained using just the variables, dummy variables were added as 
independents. The problem was then approached from a regional 
perspective^ using 5 basic regions which were defined on • the basis of 
geographical proximity. An inherent weakness was that 4 regions 
included 5 or less communities. Thus, the simple approach to 
calculating people per residential electrical hook-up was used for 
the following dates: January 1970; January 1972; and January 1973, 
Thirty-six communities were used in January 1973. This report 
presents these telephone, electrical, and school data on' appendix 
form. (NQ) 
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INTRODUCTION 



Last summer, some initkil investigations were made into the feasibility of estimating 
populations through a multiple regression technique. The results, reported in SUMMARY 
REPORT ON SMALL COMMUNITY ANALYSIS, were significant enough to warrant 
further study of the method. * 

Research was continued into fall, 1972. Sample size was increased and alternative regression 
relationships were tested. The results have not been encouraging. While significant 
relationships have been ['ou\k\. none of the multiple regression equations yielded results 
with the precision necessary for confidently applying them in the numerous small, 
nonmetropolitan communities in the state. 

The results of the various regression approaches tried are presented in this paper. In 
addition, the results of the more simplistic approacli of people per utility hook-up are 
presented at the community level. It appears as thougli this type of approach, while the 
simplest, is also tlie recommended. 

A by-product of this study has been the generation of significant amounts of consistent 
data on the communities used. In the hopes that tliese data will be of use to other 
researchers in their work, it is presented in summary fonn in an appendix. 
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TESTING FOR A SIZF: RHLATIONSI IIP 



The work done in the summer of 1972 concluded, that there was a relationship between 
population and the two independent variables tested-school cnrohment and residential 
electrical hook-ups-when allowance was made for size through the use of dummy variables. 
The sample upon which these conclusions were based was limited, however, to 14 
communities. 

When the study was continued into the falL one of the first-^steps was to increase the 
sample size to 35. Data on utility connections and elementar\' school enrollment were 
collected for each community. The expanded sample* was tested to see *if a relationship 
still existed between the population figure and electrical hook-ups and school enrollment. 

Significant relationships were found-the best fit being expressed in the equation: 

Pop70 = 1.96 Sch7o + 2.14 UtilyQ 

(.587) (.351) I 

r2 = .988 

Standard Error of 

the Estimate = 1317.2 



•Pop = 5778.9 

Where Pop^Q = the estimated 1970 population 

■ Util-yQ = residential electrical hook-ups in 1970 
Sch-yQ ■ = elementary school enrollment in 1970 



While this regression yielded significant t statistics and a good coefficient of correlation, 
the standard error of the estimate was much too large for the precision needed in estimating 
populations for communities us small as 773. 

Several alternative sample sizes, which eliminated communities above or below certain 
levels, were tested to see if this would increase the precision of the results. Unfortunately, 
this did not occur. In no case were the percentage deviations of the predicted from the 
observed all within what was judged to be a reasonable range. 

For the purposes of these population -estimates. 10 percent was decided upon as the. 
maximum percentage deviation acceptable. 

Since satisfactorv' results could not be obtained using just the two independent variables, 
school enrollment and utility connections, a selection of dummy variables were added 
as independent variables. As this had produced satisfactory results during the summer, 
it was hoped it would do so again. 
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In each attempt to add dummy variables, t statistics were obtained where they were not 
significant at the 90 or. SO percent levels of* confidence. This '.says that segmenting 
communities within n sample according to size, did not better define a relationship. While 
these results were disappointing, the concept of a multiple regression approach was not 
yet abandoned, instead, it was decided to approach the problem from a different 
perspective-from a regional outlook. 



THE REGIONAL APPROACH 

Since satisfactory' resuUs could not be obtained through the previous methods, the regional 
approach was attempted.. Tiiis approach wlis intuitively appealing, if the state was divided 
into regions, it seemed plausible that communities in the same general area might exhibit 
similar functional relationships: that is, one miglit find the same, relationship between 
population, utilities, and enroUnient within a set of comn':unities. 

All of the communities for which data were available v/ere plotted on a map. Regions 
were then defmed on the basis of geographical proxiniity. Several communities were initially 
placed in two regions until proper placement could be deteniiined. Five basic regions were 
sketched out with a sixth potential region allowed for. As it turned out, the sixth region 
was not needed. , 

Separate regressionf; were nm for each region. The results were satisfactory. In four of 
the five regions, t statistics were significant at least the 90 percent level; which in the 
other region, the t statistic was significant at the SO percent level. In all regions, the 
standard errors of the estimate were much improved over previous methods. With the 
exception of three communities, percentage deviation of the predicted from the observed 
was under 10 percent. 

The regions as defined and used were as follows: 
Region I: 

*Pop7o= -578 + 1.24 Schyo + 2.68 Util7o 
(333) (.244) . 

r2 = .999 

Standard Error of 

the estimate = 223.9 



Pop = 5487.2 
COMMUNITIES: Tombstone, Bisbee, Douglas, Benson, Willcox 
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Region II: ' 

Pop7o = 323 + 2.75 Sch7Q +• 1.24 Util7o 
(.668) (.314) 

R- = .9995 

Standard Error of 

the Estimate = 168.5 



• Pop = 4141 
COMMUNITIES: Clifton-Morenci, Duncan, Safford, Thatcher 

Region III: 

Pop7Q = -261 + 2.11 Sch7Q + 2.09 Util7Q 

(.393) (.228) 
r2 = .9993 

Standard Error of 

the Estimate = 325.45 



Pop ^ 7259 

COMMUNITIES: Florence, Sommerton, Hayden-Winkelman, Miami 
Superior, Coolidge, Eloy, Globe, Casa Grande, 
Yuma 

Region IV: 

PoF'vo " 235.59 + 1.27 Sch7Q + 2.06 Util7Q ^ 

(.382) (.226) 

R^ = .9988 

Standard Error of 

the Estimate = 179.57 



Pop = 3871 



COMMUNITIES: 



Springervillc-Eagar, Sliowlow, Snowflake, 
Holbrook, Winslow 



Region V: 

Pop7Q = -1264 + 14.60 Sch7o - 3.18 Util7o 
r2 = .9975 

• Standard lirror of 
the Estimate * = 874.52 



Pop =7188 

COMMUNMTIES: Williams, Wickenburg, Parker, Cottonwood, 
Prescott, Flagstaff 



From a statistical point of view, these regression equations met the criterion necessary 
for acceptability. Despite the low degrees of freedom, the t statistics were sufficiently 
high to still be significant at SO or 90 percent confidence levels. It would have been simple 
at this point to generate 1972 population estimates based on these equations and call 
the work done. However, after the misleading results of the summer's work, a closer 
analytical look at the results seemed appropriate. 

The equations do seem to confirm a set of regional relationships. However, even given 
a regional relationship, the question can be raised as to whether or not these equations 
should be used to treety estimate population. It is an inherent weakness that four regions 
have six or less .communities in them. This results in only 1, 2, or 3 degrees of freedom- 
a definite constraint. 

Two solutions to this problem seem apparent. First, increase the number of communities 
within the region by gathering more data. This is unworkable for the data simply are 
not available. Second, combine regions so as to increase degrees of freedom. This was 
tried and the result was a breakdown in the relationship between population, school 
enrollment, and utility connections. 

Since the means of improving the degrees of freedom problem were stymied, one is faced 
with a set of regression ecjuations, statistically significant, that could be inaccurate due 
to too few observations. 

As a result, it v/as decided to use the findings selectively. The findings were interpreted 
to identify general regional similarities in the relatipnship between dependent ajid 
independent variables. However, specific population estimates were not generated from 
the equations due to the snv.iliness of sample size. Instead, the general knowledge of regions 
was combined with a more simplistic approach toward generating specific population 
figures. 



SIMPLE POPULATION LSTl MATHS 



Rather than produce no population estimntcs, the simple approach of calculating people 
per residential electrical liook-up in 1970 was used. The people per hook-up figure was 
applied to number of hook-ups on January 1, 1972, and 1973 to produce population 
estimates for three dates. 

These results arc presented in Table 1 for 36 communities in the state for which residential 
utility hook-up figures were available. In addition, average figures were calculated for each 
region as dellned in the regression equaiions. These are shown in Tiible II. While these 
averages were not significantly different v/hcn tested, it was felt they might be of use 
ill estimating population for communities with no 1970 population figures. 



DISCUSSION 



It is disappointing to find that an approach which appeared workable in preliminary testing 
is inappropriate for general application. The people per hook-up approach is not as 
satisfying as a regression approach would have been for it docs not recognize any functional 
relationships across communities in the state. Each community is treated on an individual 
basis; whereas, if the regression equation had proved applicable, functional relationships 
would have been established that could have been applied confidently on a broader basis 
tlian simply the sample communities. 

As it stands, the people- per hook-up approach appears to be the best alternative available. 
Its weaknesses are recongized, particulariy the continuing tendency of apartment complexes 
to have one utility hook-up for all apartments. Other alternatives simpk are not workable. 

The best alternative would be population estimates based on actual birth, death, and 
migration statistics. Migration estimation proves a problem at any level of analysis in 
Arizona; however, at the community level, even birth and death figures are problematic. 
This is due to the fact that included in births and deaths for a community are those 
figures on births and deaths occurrijig outside the city limit that are reported to the 
conimunity. 

One side product of this continued study, has been the large amount of statistical data 
which has been collected. Residential electrical hook-ups, elementary school enrollment 
figures within city limits, and residential telephone hoo.k-ups have been gathered for 
appro.Kimately 36 communities. The telephone figures were gathered for use in a second 
phase once the regression equations had been establislicd. They also were iricd .as 
independent variables in the regressions, but this was not successful. 

These telephone, electrical and school data are presented in this report in appendix form. 
One of the foremost proi^lems in analyzing any sniall community is the difficulty in 
obtainii^g data, and it is hoped tliat having these figures in one central volume will be a 
step towards allevialing the problem. 
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TABLE I 



People Per 

Utility Population Population Population 

Connection Anril 70 Est, Jan 72 Est. Jan 73 



Benson 


2. 85 


2. 


,839 


3; 


,311 


2; 


,867 


Bisbee 


3.08 


8 


,328 


8, 


675 


8 


,500 


Casa Grande 


3.94 . 


10 . 


,536 


1 1 ) 


,221 


12, 


,6 19 


Clarkdale- Jerorae 


•2.18 


1 , 


, 1 82 


1 , 


308 


1 , 


,168 


Clif ton-More nci 


2.67 


8 


,140 


6 , 


,9 12 


6 , 


,955 


Coolidge 


3.62 


■ 5 


,314 


5, 


,484 


5 


,756 


Cottonwood 


2.49 


2 


,815 


3, 


,244 


3 


,329 


Douglas 


3.47 


12 


,462 


12 , 


,70 3 


12, 


,637 • 


Duncan 


3.37 




773 




8 39 




775 


E ioy 


4.67 


5 


, 381 


5 


,89 3 


6 


,3.37 


Flagstaff 


3.51 


22 


,612* 


24, 


,373* 


24 


,892* 


Florence 


3.53 


2 


,173 


2 


,562 


2, 


,6 19 


Globe 


3.55 


7 


, 333 


7 


,490 ■ 


7 


,920 


Hayden-Winkelman 


3.21 


' 2 


,257 


2 


,275 


■ 2 


,314 


Ho lb rook 


3.74 


4 


,759 


4, 


,884 


4 


,9 44 


Kearny 


4.37 


2 


, 829 


3 


,056 


3 


,068 


Mammoth 


4.07 


1 


,953 


2 , 


,079 


2 


, 356 


Miami 


3.54 


3 


, 39 4 


3, 


,416 


3 


,463 


Page 


2.19 


1 


,4 39 


3 


,111 


3 


,177 


Parker 


2.53 


1 


,948 


2 


,074 


1 


,725 


Patagonia 


2.54 




6 30 




744 




662 


Pima 


3.1 4 


1 


, 1 84 


1 


,208 


1 


,275 


Pre SCO tt 


2.53 


12 


,700* 


13, 


,153* 


13 


,813* 


Saf ford 


3.25 


5 


, 333 


4 


,989 


5 


,429 


San Manuel 


3.81 


4 


, 332 


5 


,097 


5 


,246 


Shovf Low. 


3. 78 


2 


,129 


2 


,509 


. 2, 


,790 


Snouflake 


4. 37 


1 


,977 


2 


, 1 89 


2 


, 1 89 


Somm.erton 


4.54 


2 


,225 


2 


,655 


2 


,869 


Springe rville-E agar 


5.3 4 


2 


,430 


2 


,631 


2 


,785 


Superior 


3.45 


4 


,975 


5 


,164 


5 


,275 


Thatcher 


4. 29 


2 


,082* 


2 


,243* 


2 


,372* 


Tombs tone 


2.9 4 


1 


,241 


1 


r 390 


1 


, 387 


Willcox 


2.54 


2 


,568 


2 


,646 


2 


,085 


V7illiams 


3. 37 


2 


, 386 


2 


,4 70 


2 


,544 


Wins low 


2.95 


8 


,0 6 6 


8 


,009 


7 


,478 


Yuma 


3. 32 


29 


,007 


30 


,341 


31 


,274 



♦Population adjusted for institutional inmates and those in group 
quarters 
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TABLE II 



Regional Averages - People Per Utility Hookup 



Region Average 

I 2.97 
II 3.40 
III 3.70 
IV 3.63 

V* 2.9 4 
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*Wickenburg Omitted 
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APPENDIX A 



Residential Electrical Hookups 

January 19 70 January 1972 January 19 73 



Benson 


995 


1162 


1006 


Bisbee 


2 70 2 


2817 


2760 


Casa Grande 


2669 


2848 


320 3 


Clarkdale - Jerome 


540 


600 


536 


Cli f ton-Mo re nci 


30 39 


2589 


2605 


Coolidge 


1466 


1515 


1590 


Cottonviood 


1 1 30 


130 3 


1337 


Douglas 


3589 


3661 


3642 


Duncan 


229 


249 


2 30 


Eloy 


1151 


1262 


1357 


Flagstaff 


6431 


6944 


7092 


Florence 


615 


726 


742 


Gila Bend 


490 


514 


■ 517 


Globe 


2060 


2110 


2231 


Hay de n- Wink e Iman 


70 3 


70 9 


721 


Ho lb rook 


1272 


130 6 


1322 


Kearny 


647 


. 699 


70 2 


Mamrno th 


479 


511 


579 


Miami 


S58 


965 


9 79 


Page 


656 


1421 


1451 


Parker 


769 


820 


682 


Patagonia 


248 


293 


261 


Pima. 


376 


385 


406 


Prescott 


5010 


51 89 


5460 


Safford 


1 746 


1636 


1780 


San Manuel 


1 1 35 


1338 


1377 


Shov/ Low 


563 


6 64 


738 


Sierra Vista 


1 883 


239 1 


3552 


Snov; flake 


■ 452 


50 1 


50"l 


Sommerton 


490 


585 


6 32 


Springerville - Eagar 


726 


788 


883 


Superior 


1440 


149 7 


1529 


Thatcher 


485 


523 


553 


Tombstone 


421 


473 


472 


Willcox 


1009 


1042 


820 


Williams 


70 7 


733 


758 


Wi ns low 


2732 


2715 


2535 


Yuma 


8735 


91 39 


9420 
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APPENDIX B 

Elementary School Enrollment Within City Limits 





January 19 70 


January 19 72 


Benson 


515 


569 • 


Bisbee 


130 6 


1088 


Casa Grande 


2298 


19 0 5 


Clarkdole - Jerome 


216 


225 


Clifton - Morenci 


1441 


1446 


Coolidge 


1121 


1067 


Cottonwood 


469 


755 


Douglas 


2778 


2805 


Eldy 


1536 


1550 


Flagstaff 


3260 


39 34 


Florence 


585 


595 


Gila Bend 


363 


475 


Globe 


1806 


1755 


Havden — Winkelinan 


531 


49 1 


Ho lb rook 


1378 


1307 


Kearnv 


583 


39 3 


KincTinan 


16-39 


1651 


Maininoth 


435 


484 


Mie^ li 


671 


6 79 


Nogales 


2004 


20 78 


Pacre 

1. u. W 


331 


644 


Parker 


369 


373 


pinia 


2 86 


2 79 


P s r* o t "h 


.226 6 


2266 


fi^ "F'Ford 

C Cl JL JL w ^ 


1067 


9 92 


fian Manuel 

W4X X X X^\x X w4^P«* «^ 


49 5 


721 


Show Low 


527 


599 


Snov/ flake 


782 


893 


Sommerton 


557 


651 


Springerville - Eagar 


513 


547 


Superior 


1118 


1090 


Thatcher 


466 


4C0 


Tombstone 


471 


590 


Wickenburg 


259 


, ■ 2 82 


Willc.ox 


576 


■ 562 


VJilliams 


483 


46 4 


Wins low 


1767 


1647 


Yuma 


5214 


5177 
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Residential 


APPENDIX C 
Urban Telephone 


HookuDS 




Januairy 19 70 


January 19 72 


Ajo ' ■ 


1433 


1558 


Benson 


830 


980 


Bisbee 


2689 


2911 


Camp Verde 


572 


. 80 2 


Casa Grande 


2449 


2710 


Coolidge 


1524 


1750 


Cottonv/ood 


.1206 


1568 


Douglas 


318 8 


3550 


Duncan 


221 


30 7 


Eloy 


937 


1089 


Flagstaff 


5435 


6 4 79 


Florence 


538 


599 


Gila Bend 


320 


36 3 


Globe 


3226 


35 70 


Ho lb rook 


846 2 


9 30 2 


Joseph City 


101 


120 


Mammoth 


87 


99 


Miami 


14 34 


1564 


Nogales 


231 1 


2818 


Page 


335 


904 


Parker 


413 2 


448 2 


Patagonia 


129 


'176 


Pima 


254 


29 2 


Prescott 


4957 


5 82 8 


Saf ford 


1948 


20 85 


San Manuel 


832 2 


978 2 


St. Johns 


250 2 


275 2 


Show Low 


407 


509 


Snow flake 


502 2 


640 2 


Sommerton 


351 


405 


Springerville - Eagar 


449 


532 


Superior 


949 


1041 


Tombstone 


318 


39 9 


We 1 ton 


198 


178 


Wickenburg 


998 


1203 


Willcox 


780 


845 


Williams 


526 


572 


Wins low 


1969 


2044 


Yuma 


9600 


11441 



1 - Source: Mountain Bell unless othen'/is.e specified 

2 - Supplied by Western States Telephone 
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